Fuzzy Sarsa Learning and the Proof of Existence of Its Stationary Points

نویسندگان

  • Vali Derhami
  • Vahid Johari Majd
  • Majid Nili Ahmadabadi
چکیده

This paper provides a new Fuzzy Reinforcement Learning (FRL) algorithm based on critic-only architecture. The proposed algorithm, called Fuzzy Sarsa Learning (FSL), tunes the parameters of conclusion parts of the Fuzzy Inference System (FIS) online. Our FSL is based on Sarsa, which approximates the Action Value Function (AVF) and is an on-policy method. In each rule, actions are selected according to the proposed modified Softmax action selection so that the final inferred action selection probability in FSL is equivalent to the standard Softmax formula. We prove the existence of fixed points for the proposed Approximate Action Value Iteration (AAVI). Then, we show that FSL satisfies the necessary conditions that guarantee the existence of stationary points for it, which coincide with the fixed points of the AAVI. We prove that the weight vector of FSL with stationary action selection policy converges to a unique value. We also compare by simulation the performance of FSL and Fuzzy Q-Learning (FQL) in terms of learning speed, and action quality. Moreover, we show by another example the convergence of FSL and the divergence of FQL when both algorithms use a stationary policy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ارائه الگوریتم جدید Fuzzy SARSA بهمنظور پیش بینی نوسانات سطح قند خون بیماران مبتلا به دیابت نوع یک

Background: One of the serious complications of type 1 diabetes is a sudden increase and drop in blood glucose levels causing risks of anesthesia and coma. Thus, an important step towards the optimal control of the disease is to use intelligent methods with low error rate and available information in order to predict and prevent such complications. In this paper, a combined Fuzzy SARSA algorith...

متن کامل

Diagonal arguments and fixed points

‎A universal schema for diagonalization was popularized by N.S‎. ‎Yanofsky (2003)‎, ‎based on a pioneering work of F.W‎. ‎Lawvere (1969)‎, ‎in which the existence of a (diagonolized-out and contradictory) object implies the existence of a fixed-point for a certain function‎. ‎It was shown that many self-referential paradoxes and diagonally proved theorems can fit in that schema‎. ‎Here‎, ‎we fi...

متن کامل

The Fuzzy Sars’a’(λ) Learning Approach Applied to a Strategic Route Learning Robot Behaviour

This paper presents a novel Fuzzy Sarsa(λ) Learning (FSλL) approach applied to a strategic route leaning task of a mobile robot. FSλL is a hybrid architecture that combines Reinforcement Learning and Fuzzy Logic control. The Sarsa(λ) Learning algorithm is used to tune the rule-base of a Fuzzy Logic controller which has been tested in a route learning task. The robot explores its environment usi...

متن کامل

Fuzzy Sarsa: An approach to linear function approximation in reinforcement learning

This paper investigates two different approaches to learning using an agent electronic marketplace as test bed. The types of learning considered in this paper include the temporal difference (TD) learning algorithm Sarsa, and two new fuzzified versions of this algorithm, FQ Sarsa and Fuzzy Sarsa. We implement the three learning algorithms in an agent test bed in order to determine their usefuln...

متن کامل

Fixed Fuzzy Points of Fuzzy Mappings in Hausdorff Fuzzy Metric Spaces with Application

Recently, Phiangsungnoen et al. [J. Inequal. Appl. 2014:201 (2014)] studied fuzzy mappings in the framework of Hausdorff fuzzy metric spaces.Following this direction of research, we establish the existence of fixed fuzzy points of fuzzy mappings. An example is given to support the result proved herein; we also present a coincidence and common fuzzy point result. Finally, as an application of ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008